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Abstract 


Severe acute respiratory syndrome coronavirus (SARS-CoV) is responsible for SARS infection. Nucleocapsid protein (NP) of 
SARS-CoV (SARS_NP) functions in enveloping the entire gnomic RNA and interacts with viron structural proteins, thus playing 
important roles in the process of virus particle assembly and release. Protein—protein interaction analysis using bioinformatics tools 
indicated that SARS_NP may bind to human cyclophilin A (hCypA), and surface plasmon resonance (SPR) technology revealed 
this binding with the equilibrium dissociation constant ranging from 6 to 160nM. The probable binding sites of these two proteins 
were detected by modeling the three-dimensional structure of the SARS_NP-hCypA complex, from which the important interaction 
residue pairs between the proteins were deduced. Mutagenesis experiments were carried out for validating the binding model, whose 
correctness was assessed by the observed effects on the binding affinities between the proteins. The reliability of the binding sites 
derived by the molecular modeling was confirmed by the fact that the computationally predicted values of the relative free energies 
of the binding for SARS_NP (or hCypA) mutants to the wild-type hCypA (or SARS_NP) are in good agreement with the data de- 
termined by SPR. Such presently observed SARS_NP-hCypA interaction model might provide a new hint for facilitating the un- 
derstanding of another possible SARS-CoV infection pathway against human cell. 
© 2004 Elsevier Inc. All rights reserved. 
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Between the end of 2002 and July 2003, a new epi- ronavirus, SARS coronavirus (SARS-CoV), was then 
demic disease called severe acute respiratory syndrome identified to be responsible for the infection [4-6]. Fol- 
(SARS) affected more than 30 countries [1-3]. A new co- lowing this, remarkable achievements have been made 

in genome sequencing of SARS-CoV [7,8], SARS pro- 
"2 Gomespondiaiatehons; Bass eae onesoao ree tein functional studies [9-11], three-dimensional struc- 
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drug discovery [12,19—21]. Although the SARS pandem- 
ic has been controlled by now, many scientific questions 
remain to be answered, such as which specific features of 
the virus are responsible for its pathogenicity, and which 
mechanisms are at work during infection. Furthermore, 
no vaccines or drugs are presently available for the pre- 
vention or therapy of SARS. Therefore, probing the in- 
fection mechanism at the molecular level is of utmost 
significance, since it is the foundation for discovering 
and developing anti-SARS drugs or vaccines. 

Mapping virus—host protein interactions can provide 
important clues on the initial stages of infection. Geno- 
mic sequence analyses revealed the genome organization 
and phylogeny of the SARS coronavirus [7,8]. The 
SARS-CoV genome contains 11 major open reading 
frames (ORFs) that encode, among others, the replicase 
polyprotein, the spike (S) protein, the small envelope (E) 
protein, the membrane (M) protein, and the nucleocap- 
sid (N) protein [7,8]. For coronaviruses, the N protein 
(NP) plays an important role during host cell entry 
and virus particle assembly and release [22-24]. In par- 
ticular, NP binds to a defined packaging signal on viral 
RNA, leading to the formation of the helical nucleocap- 
sid [23]. A similar function of NP has been found in 
other viruses, in addition to coronaviruses [25]. Howev- 
er, the mechanism by which NP facilitates virus replica- 
tion is not established. 

To gain insight into the function of SARS_NP in in- 
fection, we have used a bioinformatics approach to an- 
alyze potential interactions between SARS_NP and the 
proteins encoded in the human genome. We find strong 
evidence for an interaction between the SARS_NP and 
human cyclophilin A. Cyclophilins were discovered orig- 
inally for their high affinity against cyclosporin A (CsA), 
an immunosuppressive drug used to prevent allograft re- 
jection [26]. More recently, cyclophilins have been 
shown to play an important role in HIV infection. The 
Gag polyprotein of human immunodeficiency virus type 
1 (HIV-1) binds most members of the cyclophilin family 
of peptidyl-peptide-prolyl isomerases [27]. However, of 
the 15 known human cyclophilins, only human cyclophi- 
lin A (CypA) is integrated inside the viral core of HIV-1 
by interacting specifically with the capsid domain (CA) 
of the Gag polyprotein [28]. CypA subsequently per- 
forms an essential function in HIV-1 replication. The 
X-ray crystal structure of the complex between human 
CypA (hCypA) and the N-terminal domain of CA (res- 
idues 1-151) revealed the binding mode of these two 
proteins and provided a clear picture of the role that 
CypA plays in disassembly of the viral core [29]. More- 
over, it has been shown that maturation in the presence 
of CsA during virion budding causes a quantitative re- 
duction in viral infectivity [30]. 

Although no capsid protein is encoded in SARS-CoV 
genome [7,8], sequence alignment indicates that the seg- 
ment Val235-Pro369 of SARS_NP is homologous to 


HIV-1 CA with a sequence identity of 25.1% and a sim- 
ilarity of 36.7%. Encouraged by the evidence from the 
bioinformatics analysis and the knowledge of the role 
of CypA in HIV-1 infection, we presumed that 
SARS_NP may function just like the HIV-1 viral capsid 
domain, i.e., bind to human CypA (hCypA) during virus 
replication. 

We have verified the suspected SARS_NP-hCypA in- 
teraction by using surface plasmon resonance (SPR) 
biosensor technology, and the probable binding sites 
of these two proteins were detected by modeling the 
three-dimensional (3D) structure of the SARS_NP- 
hCypA complex, from which the important interaction 
residue pairs between the proteins were deduced. To val- 
idate the binding model, which involves the peptidyl- 
prolyl isomerase active site of hCypA, mutagenesis 
experiments were carried out with the two proteins, 
and the observed effects on the binding affinities were 
used to assess the correctness of the interaction model. 
The computationally predicted values of the relative free 
energies of binding of SARS_NP (or hCypA) mutants to 
the wild-type hCypA (or SARS_NP) are in good agree- 
ment with the data determined by SPR, indicating that 
the binding sites derived by molecular modeling are 
reliable. This finding may provide a new approach for 
discovering anti-SARS-CoV agents. 


Materials and methods 


Protein-protein interaction analysis by bioinformatics. A new 
method developed in our laboratory was applied to map the binding 
proteins of SARS_NP. In brief, the procedure of this method com- 
prised the following steps: (1) homology search was performed to 
identify the homologous proteins of SARS_NP. These proteins were 
selected as queries for searching the protein-protein interaction data- 
base DIP (http://www.DIP.org). (2) Proteins in the DIP database that 
possibly interact with SARS_NP homologous proteins were selected as 
the candidates of the SARS_NP binding proteins. (3) PFAM database 
(http://www.sanger.ac.uk/Software/Pfam/) was searched to obtain the 
domain distributions of the binding protein candidates. (4) Keywords 
search was performed on the PFAM and Interpro databases (http:// 
www.ebi.ac.uk/interpro/) to annotate the biological functions of the 
SARS_NP binding protein candidates. Proteins with functions related 
to immunosuppression, immunoregulation, and virus infection were 
picked out for further analysis and biological assay. 

Chemicals and enzymes. The restriction and modifying enzymes in 
this work were purchased from TaKaRa. The bacterial strains M15 
and DH5a, and the vector pQE30 were from Qiagen. TRIzol and 
Superscript II reverse transcriptase were purchased from Gibco. The 
chelating affinity column and lower molecular weight (LMW) marker 
were purchased from Amersham—Pharmacia Biotech. Isopropyl B-p- 
thiogalactoside (IPTG) was purchased from Promega. All other 
chemicals were from Sigma in their analytical grade. 

Plasmids and site-directed mutagenesis. All the recombinant DNA 
methods including PCR, restriction digestion, ligation, Escherichia coli 
transformation, and plasmid DNA preparation were performed ac- 
cording to the standard methods [31]. 

SARS-CoV (isolate BJO1) RNA was extracted with TRIzol reagent 
according to the manufacturer’s instruction (www.genehub.net/ 
trizol.htm). The reverse transcription was performed with the random 
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priming method by the Superscript II reverse transcriptase. The 
SARS_NP cDNA was subsequently amplified by PCR, using the fol- 
lowing primers: Fw, 5'ATTAGGATCCTCTGATAATGGACCCCA 
ATCA3’; and Rv, 5’'TTAAGTCGACTGCCTGAGTTGAATCAGC 
AGA 3’. After digestion with BamHI and SalI, the PCR product was 
inserted into the BamHI and Sail sites of the vector pQE30 to con- 
struct the plasmid pQE30-SARS_NP. The SARS_NP insert was veri- 
fied by sequencing. 

The site-directed mutational plasmids pQE30-SARS_NP-Q307A, 
PQE30-SARS_NP-Q307D, pQE30-SARS_NP-W302A, and pQE30- 
SARS_NP-I305A were obtained according to the Quick-change 
Mutagensis (Stratagene). 

The plasmid pQE30-CypA was kindly provided by the Center of 
Medical Laboratory Sciences, Nanjing PLA Hospital, Nanjing 210002, 
China. The site-directed mutational plasmids pQE30-CypA-R55A, 
PQE30-CypA-R55D, pQE30-CypA-Q63A, pQE30-CypA-W12IF, 
PQE30-CypA-R69A, and pQE30-CypA-R69K were obtained accord- 
ing to the Quick-change Mutagensis (Stratagene). 

Expression and purification of SARS_NP, hCypA, and mutants. 
SARS_NP wild-type protein was expressed and purified based on the 
following procedure. The recombinant plasmid pQE30-SARS_NP was 
transfected into M15 (DE3) bacterial strain. Clones were grown 
overnight in LB medium containing 100mg/L ampicillin and 25mg/L 
kanamycin. Expression of the His-tagged recombinant protein was 
induced at an ODeoo0 of 0.7-0.9 with the addition of isopropyl-B-p- 
thiogalactopyranoside (IPTG) to a final concentration of |1mM. After 
induction for 12h at 37°C, the cells were harvested by centrifugation 
for 30min at 40001r/m, 4°C and stored at —70°C. During the protein 
purification, cells were resuspended in 20 ml buffer A (20mM Tris-HCl 
pH 8.0, 500mM NaCl, 5mM imidazole, and 1mM PMSF) and then 
lysed by sonication for 15min in icy bath. The lysate was cleared by 
centrifugation at 4°C, 14,0001r/m for 60min. To the supernatant were 
added DNAase and RNAase to the final concentration of 2 1g/ml for 
either of them in order to get rid of the possible bound DNA and RNA 
scraps to the protein of interest. This mixture was incubated at 20°C 
for 3h before further treatments. 

The nuclease treated supernatant was loaded on a 4ml Sepharose 
Ni-NTA column (Amersham—Pharmacia) equilibrated with 30ml 
buffer A. The column was eluted with buffer B (20mM Tris-HCl, pH 
8.0, 500mM NaCl, and 500mM imidazole) after being washed by 
buffer C (20mM Tris-HCl, pH 8.0, 500mM NaCl, and 120mM im- 
idazole). The elution fraction was further applied to a gel filtration 
column (Hiprep 16/60 Sephacyl $100) on a FPLC (Amersham—Phar- 
macia) system. 

The expression and purification of the SARS_NP mutants 
(SARS_NP-Q307A, SARS_NP-Q307D, SARS_NP-W302A, and 
SARS_NP-I305A) in this work were carried out using the similar 
procedures for the SARS_NP wild-type protein. 

The procedures of expression and purification for hCypA wild-type 
and its mutants hCypA-R55A, hCypA-R55D, hCypA-Q63A, hCypA- 
WI121F, hCypA-R69A, and hCypA-R69K are similar to those for NP 
wild-type and its mutants except that for hCypA wild-type and its 
mutants: (1) the expression temperature is 25°C; (2) the concentration 
of IPTG is 0.5mM; and (3) the expression time is 7h. 

Protein-protein binding assay. The binding affinities of protein— 
protein interactions (hCypA to SARS_NP and SARS_NP mutant 
proteins, and SARS_NP to hCypA and hCypA mutant proteins) were 
determined emplying SPR technology based Biacore 3000 instrument 
(Biacore AB, Rapsgatan 7, S-754 50 Uppsala, Sweden). Immobiliza- 
tion of the protein to the CMS sensor chip (Biacore) was carried out by 
the standard primary amine coupling reaction. The protein to be co- 
valently bound to the matrix was diluted in 10mM sodium acetate 
buffer (pH 4.3) to a final concentration of 0.35mg/ml. Equilibration of 
the baseline was completed by a continuous flow of HBS-EP running 
buffer (10mM Hepes, 150mM NaCl, 3.4mM EDTA, and 0.005% (v/v) 
surfactant P20, pH 7.4) through the chip for 1—-2h. All the Biacore data 
were collected at 25°C with HBS-EP as running buffer at a constant 


flow of 20,l/min. All the sensorgrams were processed by using auto- 
matic correction for non-specific bulk refractive index effects. Lang- 
muir binding fitting model was used for estimating the Kp values, in 
which the association rate constant (k,,) and dissociation rate constant 
(Ko) are fitted simultaneously by rate Eq. (1), 

dR 

Gym Kon X CX (Rmax ~ R) ~ kom * R, (1) 
where R represents the response unit, C is the concentration of 
the ligands, and 


Kp = Kore /Kon- (2) 

Molecular modeling and free energy calculation. Based on the se- 
quence alignment (Fig. 2A), the 3D model of the segment Val235- 
Pro369 of SARS_NP was constructed by using Homology module of 
Insight Hf (Insight IH, 2000) and the MODELLER program [32] taking 
the 3D structure of capsid isolated from the X-ray crystal structure of 
CAj5;-hCypA complex (PDB entry 1AK4) as a template. After 
structural optimization, the 3D model of segment Val235-Pro369 was 
manually docked into the active site groove of the X-ray crystal 
structure of hCypA by taking loop Trp302-Pro310 as the SARS_NP 
binding loop. The entire complex structure was then optimized by 
using molecular mechanics method with AMBER force field and 
Kollman-all-Atom charges [33]. 

Complexes of wild-type hCypA (or SARS_NP) with mutated 
SARS_NP (or hCypA) were constructed based on the 3D model of 
SARS_NP-hCypA complex (Fig. 2B) by using Biopolymer module 
encoded in Sybyl 6.8 [34]. These complex models were solvated by a 
28A TIP3P [35] water sphere. Then the solvated models were equili- 
brated for 300ps at 300K by using molecular dynamics (MD) simu- 
lation, and a 100ps MD simulation was run on each model system, 
sampling the conformations for binding free energy calculation. All 
MD simulations were carried out by using the AMBER 7.0 program 
with AMBER force field (parm98) [33]. The MM/PBSA method [36] 
was used to evaluate the binding free energies of SARS_NP-hCypA 
complexes. In general, the average binding free energy of protein— 
protein interaction (averaged over 50 samples isolated from the MD 
trajectory), AGping, is calculated by Eq. (3): 


AGbina —_ Geomplex Gproteint 
= AEym + AGpg + AGyp, (3) 


Gprotein2 


where AFym is the interaction energy of the two proteins cal- 
culated by the AMBER force field, including internal, electro- 
static, and van der Waals components; AGpg is the polar 
solvation contribution calculated by using a finite-difference 
Poisson—Boltzmann (PB) model [37], and AGyp is the non-po- 
lar solvation energy, which is obtained from the solvent acces- 
sible surface area (SASA). The relative binding free energy, 
AAGbpina, 18 the difference between the binding free energy of 
a mutated SARS_NP (or hCypA) to wild-type hCypA (or 
SARS_NP) and that of wild-type SARS_NP to wild-type 
hCypA. 


Results and discussion 


Mapping _ interactions 


SARS_NP 


between host proteins and 


A homology search was performed between the 
SARS_CoV nucleocapsid protein (SARS_NP) and the 
yeast proteome, and 12 proteins homologous to 
SARS_NP were found. Proteins binding to these 
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SARS_NP homologous proteins possibly bind to 
SARS_NP. Therefore, taking these homologous pro- 
teins as queries, the protein-protein interaction database 
DIP [38] (http://www.DIP.org) was searched to identify 
potential SARS_NP binding proteins, and more than 
100 possible candidates were addressed. The biological 
functions of the binding protein candidates were 
checked using the annotation information in the PFAM 
and Interpro databases (http://www.ebi.ac.uk/interpro/). 
Keywords used in the protein functional check 
were “immunosuppression,” ‘“‘immunoregulation,” and 
“virus infection.” In this way, two potential SARS_NP 
binding domains related to cyclophilins [26] (PFAM ID 
PF00160) and human HIV-1 Rev-interacting protein 
(hRIP) [39] (PFAM ID PF01412) were identified. It is 
intriguing that both cyclophilins and hRIP are associat- 
ed with HIV-1 infection [27—29,39]. 


Validation of SARS_NP-hCypA interaction by SPR 
technology 


Surface plasmon resonance (SPR) measurements 
were used to determine the binding affinity of SARS_NP 
to hCypA. In separate experiments, the dissociation 
constant (Kp) of SARS_NP from hCypA immobilized 
on the CMS sensor chip and the dissociation constant 
(Ky) of hCypA from immobilized SARS_NP were de- 
termined. The binding responses in resonance units 
(RUs) were continuously recorded and presented graph- 
ically as a function of time. The association could be de- 
scribed by a simple equilibrium of the A+B <=> AB 
type (A, analyte; B, ligand; and AB, complex). To deter- 
mine the equilibrium dissociation constant for the inter- 
action, the equilibrium response (R,,) data were fit to an 
independent binding-site model, 


Rmnax,i x Cx Kaj 
Re = ’ 4 
: d. 14+Cx Ka; ”) 


where Rmax stands for the maximal response, C is the 
concentration of injection sample, and Kg is the associ- 
ation constant. For a single-site interaction, i= 1, for 
two-site binding, i = 2, and so on. The results are shown 
in Fig. 1 and listed in Table 1. For the first determina- 
tion (with hCypA immobilized on the chip), the rate as- 
sociation constant (kon) was 1.17 x 10*M~'s~!, the rate 
dissociation constant (kg) was 7.10 x 10-°s~', and the 
equilibrium dissociation constant (Kp) was estimated 
(konlkon) as 6.04 x 10~-°M. For the second determination 
(with SARS_NP immobilized on the chip), the rate asso- 
ciation constant (k/,,) and the rate dissociation constant 
(k’) between the two proteins were 854M~'s~! and 
1.36 x 10-48", respectively, and the equilibrium dissoci- 
ation constant (Ki) was 1.59 x 10-’M. This indicates 
that SARS_NP binds hCypA with a high affinity, much 
higher than the binding affinity between HIV-1 CA and 
hCypA, ~16uM [40]. Furthermore, the sensorgrams 
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Fig. 1. Surface plasmon resonance analysis of SARS_NP-hCypA 
binding. (A) Sensorgram for SARS_NP binding to an hCypA surface 
on the CMS sensor chip. Binding responses are shown for SARS_NP 
injected at concentrations of 31.25, 62.5, 125, 250, 500, and 1000nM. 
(B) Sensorgram for hCypA binding to a SARS_NP surface on the 
CMS sensor chip. Binding responses are shown for hCypA injected at 
concentrations of 0.312, 1.25, 2.5, 5.0, 10.0, 12.0, and 20.0 uM. 


(Fig. 1) and the ko, and ko values (Table 1) revealed 
that both the association and dissociation between 
SARS_NP (or hCypA) and hCypA (or SARS_NP) are 
slow processes. 


Identification of binding sites 


Encouraged by the experimental verification of the 
binding between SARS_NP and hCypA with high affin- 
ity, we made an attempt to identify the responsible bind- 
ing sites on both proteins. This was achieved by 
constructing a three-dimensional (3D) model for the 
SARS_NP-hCypA complex. At present, the X-ray crys- 
tal structure of the complex between human cyclophilin 
A (hCypA) and the N-terminal domain (residues 1-151) 
of HIV-1 capsid protein (CA,51) is one of the experimen- 
tal structures for hCypA in complex with another pro- 
tein [29,41]. Sequence alignment suggested that the 
segment Val235-Pro369 of SARS_NP is homologous 
to HIV-1 CA (Fig. 2A), with an identity of 25.1% and 
a similarity of 36.7%. Accordingly, we deduced that 
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Table | 
Binding affinities of wild-type hCypA to SARS_NP and its mutants and wide-type SARS_NP to hCypA and its mutants 
NP mutation Kon (M"! s') Kow (s-') Kp (nM) AAGbina (kcal/mol) 

Experimental* Predicted? 
hCypA binding affinity 
Wild type 1.17 + 0.005 x 10* 7.10 + 0.032 x 107° 6.04 0.0 0.0 
Trp302Ala 1.43 + 0.053 x 10* 3.41 + 0.088 x 1077 238 2.18 2.00 
Tle305Ala 7.67 + 0.094 x 10° 9.86 + 0.075 x 107+ 129 1.81 0.50 
Gln307Ala 1.59 + 0.032 x 10* 1.99 + 0.093 x 107+ 12.6 0.44 3.95 
G1n307Asp 1.12 + 0.088 x 10* 2.25 + 0.026 x 107+ 20.1 0.71 11.75 
hCypA mutation Kk, (M7's7!) Kop (8) Kb (nM) AAGbina (kcal/mol) 

Experimental® Predicted? 
SARS_NP binding affinity 
Wild type 854 + 24.5 1.36 + 0.066 x 107+ 159 0.0 0.0 
Arg55Ala 715 + 4.33 5.14+ 0.018 x 10-* 719 0.90 1.13 
Arg55Asp 4.74 + 0.019 x 10° 1.02 + 0.094 x 1073 215 0.18 4.02 
Gin63Ala 7.31 + 0.087 2.72 + 0.018 x 107+ 37,200 3.23 4.95 
Trp121Phe 7.99 + 0.096 2.11 + 0.012 x 10-4 26,300 3.03 0.78 
Arg69Ala 5.10 + 0.028 x 10° 3.80 + 0.076 x 10-* 74.6 —0.45 —0.89 
Arg69Lys 123 + 2.53 3.89 + 0.039 x 107+ 3160 1.77 0.48 


* Relative binding free energies of hCypA-SARS_NP mutants binding relative to the hCypA-SARS_NP binding calculated from the SPR 


dissociation constants (Kps). 


> Predicted relative binding free energies of hCypA-SARS_NP mutants binding relative to the hCypA-SARS_NP binding calculated by the MM/ 


PBSA method. 


© Relative binding free energies of SARS_NP-hCypA mutants binding relative to the SARS_NP-hCypA binding calculated from the SPR 


dissociation constants (K'ps). 


4 Predicted relative binding free energies of SARS_NP-hCypA mutants binding relative to the SARS_NP-hCypA binding calculated by the MM/ 


PBSA method. 


the binding site of SARS_NP to hCypA is located with- 
in this segment. We constructed a 3D model for the 
SARS_NP-hCypA complex by taking the crystal struc- 
ture of the hCypA—CA;s5; complex [29] (PDB entry 
1AK4) as a template. First, a 3D model of the segment 
Val235-Pro369 of SARS_NP was generated by homolo- 
gy modeling based on the 3D structure of HIV-1 CA,5; 
in complex with hCypA-CA [29]. Further domain anal- 
ysis indicated that the SARS_NP segment Val235- 
Pro369 is a special domain, which belongs to the SCP 
(sterol carrier protein) protein family. The modeled 3D 
structure of SARS_NP segment Val235-Pro369 was 
docked manually into the active site of hCypA, allowing 
loop Trp302-Pro310 of NP, which aligns well with HIV- 
CA Pro85-Pro93, to lie in the active-site groove of hCy- 
pA in an extended conformation. The overall structural 
model of the SARS_NP(235-369)-hCypA complex is 
shown in Fig. 2B. 

Structurally, SARS_NP loop Trp302-Pro310 fits well 
into the active-site groove of hCypA in a way similar to 
CA loop Pro85-Pro93 (Fig. 2C). Several important in- 
teractions can be deduced from the binding model 
(Fig. 3). Briefly, two hydrogen bonds form between 
the side-chain of hCypA Arg55 (N™ and N) and 
SARS_NP Gin307 (O*! and O); the N atom of hCypA 
Gln63 hydrogen bonds to the carbonyl O atom of 
SARS_NP Ile305; the N* atom of hCypA Trp121 forms 


a hydrogen bond to the carbonyl oxygen of SARS_NP 
Phe308; and the carbonyl oxygens of Gly72 and 
Asn1l02 of hCypA accept hydrogen bonds from the 
main-chain amides of SARS_NP Ile305 and Ala306, re- 
spectively (Fig. 3). Interestingly, the 3D model demon- 
strates that a cation—m interaction [42] exists between 
hCypA Arg69 and SARS_NP Trp302, and a CH: --1 hy- 
drogen bond [43,44] between the y-CH» group of 
Gin307 and hCypA Phel33 (Fig. 3). In addition, exten- 
sive hydrophobic contacts exist across the SARS_NP-— 
hCypA interface. 


Validation of the binding model by site-directed muta- 
genesis 


The modeled structure described above indicated that 
SARS_NP-hCypA complex formation occurs mainly 
between the SARS_NP loop Trp302-Pro310 and the ac- 
tive site of hCypA (Fig. 2C). To validate this binding 
model, each residue of both proteins that, according 
to our model, makes an essential contribution to com- 
plex formation (Fig. 3), was mutated and the effect of 
the mutation on the binding energy was predicted com- 
putationally as well as measured experimentally. Bind- 
ing affinity measurements were performed by SPR 
technology, and the free energy changes relative to the 
wild-type protein-protein interaction of the mutations 
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Fig. 2. Structure model for SARS_NP-hCypA complex. (A) The sequence alignments between HIV-1 capsid protein (1-231) and SARS_NP. 
Asterisks label the identical or conserved residues, colons designate the conserved substitutions, and dots point out the semi-conserved substitutions. 
Alignment was carried out by the program CLUSTALW 1.81 [47]. (B) Schematic representation of the 3D model of SARS_NP(235-369)-hCypA 
complex. The binding loop Trp302-Pro310 was highlighted in red color. Residues shown in ball-and-stick model are essential in the SARS_NP- 
hCypA binding, and site-directed mutagenesis were performed on these residues to validate this binding model. This picture was generated using the 
MolScript program [48]. (C) Binding of SARS_NP loop Trp302-Pro310 (shown as CPK model) to the hCypA active-site groove. The hCypA is 
rendered by electrostatic surface calculated by MOLCAD program encoded in Sybyl 6.8 [34]. (For interpretation of the references to colour in this 


figure legend, the reader is referred to the web version of this paper.) 


were calculated using MM/PBSA [36]. The results are 
listed in Table 1. During the SPR determination, wild- 
type hCypA was immobilized on the chip in the assay 
for hCypA binding to mutated SARS_NP proteins; 
while in the assay for SARS_NP binding to the mutated 
hCypA proteins, wild-type SARS_NP was immobilized 
on the chip. Overall, the mutational analyses revealed 


that SARS_NP loop Trp302-Pro310 is the key determi- 
nant for hCypA recognition, and that this loop binds to 
the active-site groove of hCypA, probably exactly as 
predicted by our model (Figs. 2 and 3). 

For the SARS_NP protein, the mutational analyses 
revealed that Trp302 is a major determinant of hCypA 
recognition, since mutation of Trp302 to Ala reduced 
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Fig. 3. Hydrogen bonds (green dot lines), cation—1 (blue dot line), and 
-CH. --m hydrogen bond (red dot line) interactions between SARS_NP 
and hCypA. Residues of SARS_NP are represented as ball-and-stick 
model, and residues of hCypA are represented as stick model. Atoms 
are rendered by colors of atom types: carbon gray, nitrogen blue, and 
oxygen red. The two green balls represent the centers of benzene rings 
of Phel13 and Trp121, respectively. Distances are in Angstrom. This 
picture was generated using the MolScript program [48]. (For 
interpretation of the references to colour in this figure legend, the 
reader is referred to the web version of this paper.) 


the hCypA binding by 2.18 kcal/mol, which agrees well 
with the computational predicted value of relative free 
energy, 2.0kcal/mol. The mutation Trp302Ala abolishes 
the cation—n interaction between the indole ring of the 
tryptophane and the side-chain of hCypA Arg69 (Fig. 
3). The SARS_NP-hCypA binding model predicts a 
contact between the side-chain of Ile305 and hCypA 
Asn102. Indeed, replacement of Ile305 by Ala decreases 
the binding affinity by 1.81 kcal/mol (Table 1). This is al- 
so in agreement with the computational value of 
0.55 kcal/mol. Mutation of Gln307 to Ala abolishes the 
hydrogen bond formed between this residue and hCypA 
Arg55 (N") and also the C-H- -- hydrogen bond with 
hCypA Phel13 (Fig. 3), thereby reducing the hCypA 
binding affinity by about 0.44kcal/mol. A decrease of 
the latter, albeit of larger magnitude (3.95kcal/mol), 
had also been predicted by the theoretical calculations. 
To confirm the C-H---m hydrogen bond, we performed 
a mutation of Gln307 to Asp. The negatively charged 
side-chain of Asp creates a repulsive interaction with 
the z electrons of the benzene ring of hCypA Phel33, 
thus decreasing the hCypA binding affinity by about 
0.71 kcal/mol. 

In addition to SARS_NP residues predicted to be im- 
portant by the binding model, we also mutated relevant 
hCypA residues. Mutation of Arg55 to Ala disrupts the 
two hydrogen bonds formed by the side-chain with the 
carbonyl O and side-chain O*' atoms of Gln307 (Fig. 
3), respectively, which reduces the binding affinity (as 
determined by SPR) by about 0.90 kcal/mol. Mutation 
of Gln63 to Ala abolishes the hydrogen bond between 
its side-chain and the carbonyl O atom of [e305, thus 


decreasing the SARS_NP binding affinity by about 
3.23 kcal/mol. Trp121 contributes to SARS_NP binding 
by forming an intermolecular hydrogen bond between 
its indole NH and SARS_NP Phe308 O atom. Mutation 
of this tryptophane to Phe decreases the SARS_NP 
binding affinity by 3.03 kcal/mol. Additionally, mutation 
of Arg69 to Lys was also performed, which decreases the 
SARS_NP binding affinity by 1.77kcal/mol. This result 
indicated again that a cation—n interaction exists be- 
tween hCypA Arg69 and SARS_NP Trp302 (Fig. 3). 
For the above mutations, the free energy changes for 
SARS_NP binding to hCypA mutant proteins, relative 
to SARS_NP binding to wild-type hCypA, were also 
calculated by using the computational method, and all 
of these were found to be well in agreement with the ex- 
perimental results (Table 1). 

Mutation of hCypA Arg69 to Ala was performed to 
abolish the cation—n interaction with SARS_NP (Fig. 
3). However, SPR determination indicated that this mu- 
tation increased the binding affinity by about 0.45 kcal/ 
mol, and free energy prediction basically agreed with 
this (AAG bing = 0.89 kcal/mol). The SARS_NP-hCypA 
binding model indicated that this cation—n interaction 
is located at the rim of the protein-protein interface 
and may be surrounded by solvent molecules (Fig. 
2B). Molecular dynamics simulations (unpublished re- 
sult) indicated that there are many water molecules in 
the SARS_NP-hCypA interface due to the attraction 
of the cationic side-chain of Arg69. Substitution of 
Arg69 by Ala tends to prevent water molecules from en- 
tering the protein-protein interface due to the hydro- 
phobicity of Ala and Trp302, which is beneficial to the 
SARS_NP-hCypA binding. 

Binding affinity changes caused by the described 
mutations can be rationalized energetically by the 3D 
model of SARS_NP-hCypA complex, and the experi- 
mentally determined relative binding free energies 
(AAGpina) are largely in agreement with the predicted 
values resulting from the calculations based on the 3D 
model (Table 1). The SARS_NP-hCypA binding model 
therefore clearly rationalizes why changing the side- 
chains of some residues is energetically unfavorable or 
favorable. This demonstrated further the reliability of 
the interaction model. 


Significance of SARS_NP-hCypA interaction for SARS- 
CoV infection 


Although cyclophilins, especially CypA, have been 
studied extensively, the functions of this family of pro- 
teins are not known with certainty [27]. Recently, the 
significance of the Gag—hCypA interaction for the retro- 
viral life cycle has been demonstrated [27—30]. Evidence 
for Gag—-hCypA interaction during HIV-1 virus infec- 
tion comes from the observation that cyclosporin 
(CsA), an inhibitor of cyclophilin, blocks viral infection 
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if present at the time of virus entry into host cells, and 
leads to a decrease of the yield of infectious particles re- 
leased if added to infected cells [30,45,46]. 

In conclusion, we have found in the present study that 
the NP of SARS-CoV binds hCypA with high affinity; 
the equilibrium dissociation constant (Kp) ranges from 
6 to 160nM, which is 100-2700 times higher than the 
binding affinity of HIV-1 CA to hCypA. Sequence align- 
ment and molecular modeling revealed that SARS_NP 
segment Val235-Pro369 may interact with hCypA (Fig. 
2); more precisely, SARS_NP loop Trp302-Pro310 fits 
into the active-site groove of hCypA through hydrogen 
bonding, cation-t and -CH---a hydrogen bonding 
interactions (Figs. 2C and 3). The putative binding sites 
have been verified by site-directed mutagenesis and de- 
termination of the binding affinity (Table 1). Hopefully, 
this observed SARS_NP-hCypA interaction might pro- 
vide a new hint for the understanding of a possible 
SARS-CoV infection pathway against human cell, and 
further supply a feasible approach for anti-SARS agent 
screening. 
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